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The  purpose  of  this  note  is  to  outline  the  need 
for  a  clearing  house  for  computer  programs  of  use  in 
marketing  research  and  to  propose  a  viable  mechanism 
to  meet  this  need. 

THE  PROBLEM 

The  1960's  have  seen  an  accelerated  interest  in 
empirical  research  in  marketing.   One  need  only  peruse 
the  relatively  new  journals  in  the  field  --  the  Journal 
of  Marketing  Research  and  the  Journal  of  Advertising 
Research  --  to  perceive  this  trend.   The  advent  of  the 
third  generation  computer  with  its  greatly  enlarged 
high  speed  storage  as  well  as  time  sharing  capability 
will  no  doubt  give  further  impetus  to  this  trend,  as 
will  the  hoped  for  development  of  a  marketing  data 
bank  and  clearing  house. 

For  purposes  of  discussion  it  would  seem  that 
computer  needs  in  marketing  research  may  be  usefully 
categorized  as  follows: 

I.  Data  Reduction  Systems 

A.  Cleaning  and  Editing  Systems 

B.  Retrieval  and  Summary  Systems 

II.  Statistical  Systems 

A.  Univariate  and  Multivariate  Analysis 

B.  Estimation  Systems  for  Models 

III.  Special  Purpose  Languages 
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These  categories  will  be  considered  below  in  terms  of 
the  need  for  general  availability  of  programs,  current 
examples,  and  future  potential. 
Data  Reduction  Systems 

The  examples  cited  in  this  section  will  tend  to  be 
drawn  from  experience  with  panel  data  files.   The  basic 
notions  and  systems  needs  are,  however,  applicable  to 
any  large,  disaggregated  data  file. 

Any  researcher  who  has  sought  to  use  a  large, 
disaggregated  raw  data  file  is  well  aware  of  what  a 
horrendous  task  it  is  to  prepare  these  data  for 
further  analysis.   The  development  of  systems  to  prepare 
the  data  involves  considerable  manpower  and  financial 
costs  to  say  nothing  of  psychological  costs  in  the 
frustrating  task  of  debugging  a  system.   No  doubt 
these  problems  have  limited  the  use  made  of  such  data 
files  in  the  past. 

The  first  step  in  processing  a  raw  data  file  is 
to  clean  and  edit  the  data.   In  spite  of  the  careful 
and  rigorous  standards  of  quality  control  maintained 
by  the  MRCA  and  the  Chicago  Tribune  panels,  it  is 
important  that  the  data  be  cleaned  and  edited  prior  to 
analysis.   By  cleaning  and  editing  is  meant  screening 
of  the  data  to  make  certain  that  the  data  fields  contain 

information  in  valid  codes,  gross  keypunching  errors 

2 
haven't  occurred,   and  that  the  data  are  in  appropriate, 
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packed   format  for  efficient  processing  later. 
Professors  Ronald  E.  Franl<  and  William  F.  Massy  [3] 
have  developed  a  cleaning  and  editing  package  for 
MRCA  National  Consumer  Panel  and  Chicago  Tribune  Panel 
data.   The  present  version  is  available  in  FORTRAN  II 
with  certain  FAP  subroutines.   As  this  system  is  trans- 
lated into  FORTRAN  iV  and  the  assembly  languages 
appropriate  to  the  third  generation  computers,  it  is 
important  that  an  effective  mechanism  be  developed  for 
making  this  new  package  widely  available  to  the 
marketing  profession. 

Large,  disaggregated  data  files  are  rarely  used 
as  is.   Generally,  the  analyst  must  first  retrieve  and 
summarize  information  from  the  files  prior  to  carrying 
out  his  intended  analysis.   For  example,  if  his  purpose 
is  to  estimate  a  stochastic  model  of  brand  choice,  he 
must  first  retrieve  the  sequence  of  brands  chosen  by 
each  household.   In  any  case,  whether  the  ultimate 
analysis  is  the  estimation  of  a  stochastic  model  or  the 
development  of  a  multivariate  model,  he  will  need  to 
develop  a  programming  system  which  will  transform  the 
cleaned  and  edited  data  into  a  format  appropriate  for 
statistical  analysis.   If  the  models  and  statistical 
procedures  which  are  developing  in  marketing  are  to 
receive  wider  use  within  the  profession,  it  would  seem 
imperative  that  systems  for  retrieval  and  summary  of 
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data  be  developed  and  made  available.   Such  systems 
would  reduce  the  technological,  time,  and  cost  barriers 
to  progress  on  this  front.   As  such  they  would  free  the 
analyst  to  give  greater  consideration  to  the  empirical 
viability  and  relevance  of  the  various  models  and 
research  techniques. 

Once  again  the  Frank-Massy  panel  data  system 
provides  an  example  of  a  system  for  generating  a 
flexible  set  of  summary  measures.   Their  Household 
Purchasing  Characteristics  Generation  System  and  Time 
Series  Generation  System  provide  a  wide  range  of  summary 
data  for  analytic  purposes.   There  is  a  need  in  the 
future  for  more  such  systems  which  generate  measures 
not  currently  available  in  the  Frank-Massy  system. 
In  any  case,  there  is  a  need  for  a  mechanism  to  call 
these  developments  to  the  attention  of  the  marketing 
profession  in  an  efficient  manner.   At  present,  most 
of  this  information  is  disseminated  by  word  of  mouth. 
Statistical  Routines 

There  is  also  a  need  within  the  profession  for 
efficient  and  wide  ranging  interchanging  of  information 
regarding  available  statistical  routines.    In  parti- 
cular, one  can  think  of  three  classes  of  statistical 
applications  where  this  interchange  would  be  most 
useful . 
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Class  I.   Enhanced  Efficiency  or  Generality  of  Standard 
Procedures 

It  would  be  useful  to  have  a  clearing  house  for 
information  on  improved  programs  for  using  standard 
statistical  procedures.   For  example,  the  basic  multi- 
variate analysis  programs  by  Cooley  and  Lohnes  [2  J 
have  been  chained  and  generalized  by  Professor  Charles 
King  of  Purdue  and  also  by  Kenneth  Jones  of  Brandeis 
[6],   They  have  extended  the  utility  and  efficiency  of 
the  basic  programs  by  expanding  the  size  of  the 
problems  which  the  system  can  handle,  providing  new 
features,  and  chaining  the  sub-systems  for  more 
efficient  processing.   It  is  of  clear  interest  to  the 
marketing  profession  to  have  available  the  most  powerful 
and  efficient  systems  for  data  analysis. 
Class  II.   New  Analytic  Procedures 

As  new  methods  for  analyzing  data  are  developed 
and  found  to  be  relevant  in  marketing  applications, 
there  will  be  a  need  for  exchange  of  information  on 
available  programs.   The  utility  of  this  interchange 
will  be  especially  clear  to  anyone  who  has  ever  been 
involved  in  writing  and  debugging  a  program  for  multi- 
variate analysis. 

One  can  foresee  certain  of  these  procedures  on  the 

horizon.   For  example,  Claycamp  and  Massy  [1]  have 

k 
proposed  the  use  of  cluster  analysis   in  market  segmenta- 
tion.  They  report  that  work  is  underway  on  an  empirical 
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application  which  will  result  in  an  operational  cluster 
analysis  program  wh ich  undoubtedly  would  be  of  interest 
to  other  researchers. 

Another  area  where  new  analytic  techniques  and 
programs  will  be  likely  to  develop  is  in  the  area  of 
mu 1 t i -d imens ional  scaling.  The  development  of  multi- 
dimensional scaling  procedures  and  operational  programs 
for  their  implementation  may  be  expected  to  render 
psychometric  procedures  more  useful  and  relevant  in 
marketing  research  in  the  future.   Once  again,  there 
is  a  clear  need  for  a  mechanism  to  disseminate  to  the 
marketing  profession  information  regarding  computer 
systems  which  can  carry  out  such  analyses. 
Class  III.   Estimation  Systems  for  Models 

One  aspect  of  the  increasing  interest  in 
empirical  research  in  marketing  has  been  the  develop- 
ment of  data  based  models,  A  wider  use  of  these 
modeling  techniques  in  the  future  would  be  facilitated 
by  the  dissemination  of  systems  for  estimating  the 
various  models.   For  example,  minimum  chi  square 
procedures  have  been  proposed  for  estimating  and 
testing  several  recent  stochastic  models  of  consumer 
brand  choice.   See  Morrison  [9],  Massy  [?],  and 
Montgomery  [8].   It  would  be  useful  to  have  the 
estimation  programs  for  these  models  be  made  available. 
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Special  Purpose  Languages 

While  such  general  purpose,  problem  oriented 
languages  as  FORTRAN  and  ALGOL  are  readily  available 
to  a  broad  spectrum  of  market  researchers,  there  are 
certain  other  useful  special  purpose  languages  which 
are  not,  perhaps,  so  visible.   For  example,  there  are 
IPL-V,  the  list  processing  language  useful  in  programming 
decision  flow  models,  and  SIMSCRIPT,  a  general  purpose 
simulation  language. 

Once  again,  the  advent  of  the  third  generation 
computer  with  its  capacity  for  on-line  interaction 
between  man  and  machine  promises  to  witness  a  veritable 
explosion  of  new  processing  languages  to  aid  the 
researcher  in  utilizing  the  enormous  capacity  of  these 
new  computers.   A  notable  beginning  has  already  begun 
in  this  direction  with  the  development  of  the  OPS-3 
system  for  on-line  computation  and  simulation  by 
Greenberger,  Jones,  Morris,  and  Ness  [h]   at  M.  I.  T. 

For  the  marketing  profession  to  continue  its 
accelerating  development  in  the  area  of  empirical 
research  and  model  building  --  including  simulation, 
algorithmic,  and  stochastic  models  --  it  will  be 
necessary  for  researchers  in  the  field  to  take  advantage 
of  technological  advances  in  the  state  of  the  research 
art.   A  positive  contribution  to  the  rapid  and  efficient 
dissemination  of  information  regarding  the  relevant 
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dimensions  of  these  technological  advances  in  the 
computer  area  could  be  made  by  a  clearing  house  for 
information  on  these  developments. 
Summary 

In  sum,  it  has  been  suggested  that  a  very  useful  contri- 
bution to  research  in  marketing  could  be  made  by  a 
clearing  house  for  informal  ion  on  data  reduction  programs, 
statistical  packages,  and  special  purpose  computer 
languages.   The  contributions  would  be  greater  efficiency 
in  empirical  research,  a  broader  awareness  within  the 
profession  of  the  tools  available,  and  more  rapid 
diffusion  of  new  techniques  and  models  of  use  in 
empirical  research. 

A  PROPOSED  SOLUTION 
Now  that  the  case  for  a  forum  for  the  exchange 
of  information  regarding  computer  programs  for  data 
reduction  and  statistical  analysis  and  special 
purpose  languages  has  been  developed,  what  can  be 
done  to  meet  this  need?  The  author  would  1  i ke  to 
propose  that  the  Journal  of  Marketing  Research 
establish  a  new  section  entitled  something  like 
"Computer  Applications  in  Marketing  Research"  in 
order  to  provide  the  profession  with  a  forum  for  this 
vital  exchange.   A  successful  model  for  this  under- 
taking is  the  "Computers  in  Behavioral  Science" 
section  of  Behavioral  Science.   Since  I960,  Behav  ioral 
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Sc  ience  has  provided  behavioral  scientists  with 
abstracts  of  computer  programs  of  relevant  to  the 
behavioral  sciences  as  well  as  short  article  length 
discussions  of  major  systems  and  models. 

Why  suggest  the  J MR  as  the  vehicle  for  meeting 
this  need?   First,  in  its  relatively  short  history, 
the  JMR  has  become  a  major  mechanism  for  the  exchange 
of  research  results  and  procedures  among  those 
interested  in  advancing  the  scientific  approach  to 
marketing.   Hence,  it  has  as  its  audience  those  who 
would  benefit  most  from  the  proposed  section.   Second, 
it  has  the  broadest  spectrum  of  interest  in  marketing 
problems  among  the  journals  in  the  field.   Finally,  it 
would  seem  that  a  journal  having  the  editorial  and 
audience  characteristics  of  the  JMR  would  prove  to  be 
the  most  efficient  and  effective  means  for  a  broader 
and  more  rapid  exchange  of  information  in  this  area. 
Format 

The  format  which  such  a  section  would  take  might 
be  as  fol lows : 

A.  Abstracts  of  available  computer  programs 

The  dimensions  of  the  abstracts  are  discussed 
below. 

B,  Short  article  length  papers 

I.   Input/Output  descriptions  for  data 
reduction  systems  which  are  too 
extensive  to  fit  in  the  Abstract 
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sect  ion . 

2.  On  rare  occasions  it  may  be  useful  to 
present  a  short  article  description  of 
less  well  known  statistical  procedures 
and  the  program  available  to  perform 
the  computations.   In  addition,  it  may 
be  helpful  to  provide  notes  on  problems 
and  pitfalls  in  the  application  of 
certain  programs. 

3.  Short  discussions  of  the  major  innovative 
features  and  useful  functions  of  special 
purpose  languages.   These  descriptions 
should  provide  enough  information  to  the 
researcher  to  enable  him  to  determine 
whether  or  not  it  would  be  worthwhile 

for  him  to  explore  the  use  of  the  language 

further. 
Abstracts  of  Computer  Programs 

In  order  for  a  computer  program  abstract  to  be 
published  in  this  proposed  section  of  the  Journal  of 
Marketing  Research  there  should  first  of  all  be  a 
reasonable  likelihood  that  there  will  be  general 
interest  in  it  among  researchers.   Beyond  this  the 
individual  submitting  the  abstract  should  also  provide 
a  concrete  example  of  the  documentation  which  will  be 
made  available.   Such  documentation  should  include  an 
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expanded  version  of  the  abstract  material,  references 
to  statistical  and  mathematical  sources,  flow  charts, 
control  card  formats  and  definitions,  and  a  listing  of 
the  source  program.   The  format  used  in  the  chapters 
of  Cooley  and  Lohnes  [2]  would  be  a  useful  model. 

Editorial  responsibility  would  principally  be  to 
screen  submitted  programs  for  general  relevance  and 
for  evidence  of  adequate  documentation.   In  no  way  can 
or  should  the  JMR  accept  responsibility  for  the 
accuracy  of  the  programs.   Its  function  is  solely  one 
of  disseminating  relevant  information  in  an  efficient 
manner.   It  would  bear  no  responsibility  for  supplying 
the  programs  to  prospective  users  or  for  maintaining 
the  systems.   The  responsibility  for  supplying  documenta- 
tion and  program  decks  must  rest  with  the  researcher 
submitting  the  program  abstract. 

The  items  which  should  appear  in  the  abstracts 
along  with  some  clarifying  comments  are  presented 
below: 
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A.  Program  Title 

B.  Name  of  Indiv idual (s)  Submitting  the  Abstract 

C.  Description 

A  brief  description  of  the  program  in  terms  of 
inputs,  computations,  and  outputs.   It  may  be 
necessary  to  present  summary  comments  on  any 
mathematics  here. 

D.  Capacity  and/or  Limitations 

For  example,  limits  on  the  number  of  variables 
or  the  samp] e  s  ize . 

E.  Running  Time 

May  be  heuristic  or  a  summary  of  experience 
on  representative  problems. 

F.  Machine  and/or  Language  Specifications 

What  language  is  the  system  written  in?  For 
what  machine?  What  systems  idiosyncrac ies 
will  affect  the  transfer  of  the  source  deck 
between  computation  centers? 

G.  Aval labi 1 i  ty 

What  is  available?   Source  deck?  Object 
deck?   Listing?  Where  and  from  whom?   Is 
there  any  charge?   If  so,  how  much?   Does 
the  source  wish  to  serve  as  the  focal  point 
for  program  maintenance? 
CONCLUSION 
This  note  has  outlined  a  need  for  a  forum  for 
the  exchange  of  information  regarding  computer  appl ica- 
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cations  in  marketing  research  and  has  proposed  what 
would  prove  to  be  an  efficient^  effective,  and  viable 
method  to  meet  this  need.   The  establishment  of  such 
a  vehicle  should  also  serve  as  a  considerable  incentive 
for  researchers  in  the  field  to  document  their  systems 
well  and  to  make  them  more  generally  available  to  other 
interested  researchers.   It  is  the  hope  of  the  present 
author  that  this  proposal  may  serve  as  a  catalyst 
which  will  soon  result  in  such  a  forum  coming  into 
be  ing . 

In  conclusion,  perhaps  it  would  be  well  to  note 
that  it  would  be  possible  to  broaden  the  scope  of  the 
proposed  section  to  include  abstracts  of  data  which 
are  available  for  academic  research,  giving  information 
as  to  the  source  of  such  data,  cost,  restrictions, 
format,  etc.   Such  abstracts  could  serve  as  valuable 
adjuncts  to  a  marketing  data  bank  or  perhaps  even 
serve  as  a  surrogate  for  a  data  bank. 
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FOOTNOTES 


•"The  author  is  Assistant  Professor  of  Management  in  the 
Alfred  P.  Sloan  School  of  Management,  Massachusetts 
Institute  of  Technology. 

'see  [5]. 

2 

For  example,  the  price  per  unit  should  be  reasonable 

for  the  product  involved.   Thus  a  price  of  $5.00  per 

pound  of  coffee  would  clearly  be  in  error,  and  the 

screening  and  editing  program  would  reject  such  a 
data  record. 

3 

By  packing  is  meant  the  procedure  of  storing  several 

individual  records  in  one  logical  blocl<  on  a  magnetic 
tape.   Since  large  scale  computers  are  generally 
input/output  bound  when  dealing  with  large  data  files, 
considerable  computer  time  savings  may  be  realized  by 
using  packed  records  for  input/output  purposes. 

k 
The  growing  interest  in  this  technique  is  exemplified 

in  the  fact  that  Professors  Ronald  Frank  and  Paul 

Green  have  also  proposed  an  application  of  cluster 

analysis  in  marketing. 

For  example,  the  author  and  several  others  have  had 
some  difficulty  with  a  widely  circulated  multiple 
discriminant  program  which  sometimes  yields  large 
negative  eigenvalues  from  a  positive  semi -def in i te 
matrix,  a  result  which  is  theoretically  impossible. 
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